A mathematical theorem as basis for the second law: Thomson's formulation applied 

to equilibrium 



A.E. Allahverdyan 1 ) and Th.M. Nieuwenhuizen 2 ) 
1 ' Yerevan Physics Institute, Alikhanian Brothers St. 2, Yerevan 375036, Armenia. 
f"^ , 2 ' Institute for Theoretical Physics, University of Amsterdam 

Valckenierstraat 65, 1018 XE Amsterdam, The Netherlands. 
CN \ (February 1, 2008) 

There are several formulations of the second law, and they may, in principle, have different 
domains of validity. Here a simple mathematical theorem is proven which serves as the most general 
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i ' basis for the second law, namely the Thomson formulation ('cyclic changes cost energy'), applied 

, to equilibrium. This formulation of the second law is a property akin to particle conservation 

(normalization of the wavefunction) . ft has been stricktly proven for a canonical ensemble, and 
made plausible for a micro-canonical ensemble. 

As the derivation does not assume time-inversion-invariance, it is applicable to situations where 
persistent current occur. This clear-cut derivation allows to revive the "no perpetuum mobile in 
equilibrium" formulation of the second law and to criticize some assumptions which are widespread 
in literature. 

The result puts recent results devoted to foundations and limitations of the second law in proper 
perspective, and structurizes this relatively new field of research. 

a 
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a. 

I . I. INTRODUCTION 

The second law is undoubtedly one the most known statements of statistical thermodynamics [|l|. Its most known 
formulation is 'the entropy of a closed system cannot decrease'. Despite of its important role in the modern science 
— or may be even due to this role — its typical formulations arc frequently folklore- minded and not very explicit. 
After all, what is precisely meant by entropy? Moreover, the law is rarely formulated rigorously 0]. This has led 
to a pertinent opinion that the second law is an empiric relation which is supported by observations, and at least 
not inconsistent with the formalism of quantum physics. This situation is especially unfortunate, since the absence 
of explicit formulations of the second law makes it difficult to study its generalizations or to limit its domain of 
applicability in extreme (quantum) conditions . This became additionally complicated by the fact that the most 
typical formulations of the second law use the concept of entropy, which is a context-dependent quantity and which is 
7-H ' frequently not observed directly. Indeed, the standard definition dS — &Q /T is only an identification of the measured 
heat with a change in the thermodynamic entropy; 'measuring' entropy can be done in numerics if one determines 
the fraction of time that states are visited (7) , but other definitions of entropy occur as well || . All by all this led 
to a disappointing situation, where far less known and less important subjects of statistical physics received much 
attention, while the second law itself still keeps its not very explicit and vague look. The situation became acute, when 
we discovered that several formulations of the second law (Clausius inequality, positivity of energy dispersion and 
entropy production) are violated in the standard model for quantum brownian motion, which is a harmonic quantum 
O ■ particle coupled to a bath of harmonic oscillators Mpl . 

In the present paper we try to bridge this gap, and restate that the second law of thermodynamics — as formulated 
. by Thomson — is just a rigorous theorem of quantum mechanics, comparable to particle conservation (normalization 
of the wavefunction) . Standard quantum mechanics completely suffices for derivation of the theorem and its adequate 
interpretation. The Thomson formulation — and this is its main advantage over all other formulations — uses the 
unambiguous and well-defined concept of work. In contrast to entropy, work is a relatively straightforward quantity, 
and its use does not assume any particular caution. The proposed clear-cut Thomson formulation will allow us to 
establish a connection between the third and second laws, to analyze certain opinions expressed in literature about the 
second law, as well as to put into the proper perspective the recent attempts towards identification of limits of validity 
for the second law Q] . From the mathematical viewpoint the presented results are not completely new, since the main 
theorem appeared with a different, more complicated proof in works of Pusz and Woronowicz J9J and Lenard [fj0| . 
The purposes of these authors were quite different from ours, since they used the theorem as an argument towards 
describing the quantum equilibrium state through the Gibbs distribution. 
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For a general, pedagogic text on the history and today's status of thermodynamics and the second law, we refer to 
the recent work by Uffink . For a collection and discussion of the original papers, see the book by Kestin . A very 
recent discussion of the second law within the axiomatic thermodynamics was presented by Lieb and Yngvason [jl3| . 
A dialogue on the some of the definitions of entropy was reproduced by Maes and Lebowitz ||. 

The setup of this paper is as follows. In section II we will derive the theorem for the quantum mechanical situation 
and in section III we consider the derivation for the classical case. In section IV we close with a discussion. 



II. QUANTUM MECHANICAL PROOF OF THOMSON'S FORMULATION IN EQUILIBRIUM 

Here we shall present a general proof of Thomson's formulation of the second law as applied to equilibrium: No 
work can be extracted from a closed equilibrium system during a cyclic variation of a parameter by an external source. 

The idea of the following derivation was given by Lenard jlO| . He was adopting to the physical language a more 
general proof given in |}). This last proof is fairly difficult for the average physically-minded reader, since it uses the 
techniques of C*-algebras. As a by-product of our present consideration we will significantly simplify the original 
derivation of Lenard. 

A closed quantum statistical system is considered. The dynamics is described by the Hamiltonian TLq. At the 
moment t — an external time-dependent field is switched on, and the Hamiltonian becomes Ti{t). This field 
represents the influence of an external, deterministic source. The field is switched off at the moment t, and the 
Hamiltonian will be again H.q. Thus we have a cyclic variation of a parameter with at least one period. Neither the 
explicit character of this parameter, nor the Hamiltonians Tto and Tt(t) have to be specified. It is only assumed that 
initially, before the variation has started, the system was in the equilibrium state described by the Gibbs distribution: 

P -I3n 

p(0) = ^ r , Z = tre-"*«, (1) 

where (3 = l/T is the positive inverse temperature. In the time-interval t the source of the external field does work 
on the system. Since the system is closed before and after the variation, the work is equal to the difference between 
the final and initial energies: 

W = ti{H [p(t)-p(0)}}. (2) 

It can also be written alternatively as 



ds ' 

where one uses integration by parts, and the equation of motion: 



W = [ ds tr[p(i 
Jo 



(3) 



iK^- = [H(t),p{t)]. (4) 
Let us now go to the interaction representation and introduce a unitary operator V as 

p(t) = e im °' n Vp(0) V ] e - ltn ° /h . (5) 

Eq. (|J) now reads 

W = tr[H Q Vp(0)V^}-tr[H Q p(0)]. (6) 

It is seen that as far as the work is concerned, any cyclic variation enters only through its corresponding unitary 
operator V. Our aim now is to show that W defined by (JsJ) is nonnegative, or in other words, the final average energy 
is not smaller than the initial one. Notice especially that we compare only the average energies. 

Due to Eq. (Q) p(0) and Hq commute, and thus have a common eigenbasis \k). Let us denote eigenvalues of p(0) as 
{rfe}, and those oiTto as {hk}- It holds that r^ — exp(—(3hk)/Z. For simplicity we will consider a finite dimensional 
Hilbert space. One has 

n 

(7) 

m. k— 1 
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where n is the dimension of the corresponding Hilbert space, and v m k — {m\V\k)(k\V^\m) . Since V is unitary, 
VV' = V'V = 1, it follows that v m k is double-stochastic: 

n n 

V m k > 0, ^2 V mk = ^ v mk = 1- (8) 

m=l fc=l 

One arranges the ft. m in a non-decreasing order 

hi < h 2 < < h n , (9) 

which implies (due to the fact that the exponential function is monotonic) that the r m are arranged as 

n > r 2 > > r n > 0. (10) 



The work (0) reads in these variables 



where s m is defined as 



W h m s m - ^ h m r m (11) 



S m = /.Vmkrk- (12) 
fe=l 

Now we employ a summation by parts (the discrete analog of integration by parts) 

n n — 1 m n 

^ = - X] - E Si + hn E Sfc ' ( 13 <* 

m— 1 m=l i— 1 fc— 1 

and the same with r m replacing s m , to obtain 

n— 1 m n n— 1 m 

W = ^ (hm+i - h m )'^2(r i - Si) + h n )^(sk - Tk) = ^ (h m +i - h m ) }Xn - s») (14) 

m— 1 i= 1 m— 1 i— 1 

In the second step Eq. ^ was used. To prove that W > 0, notice that /i m +i — ft m > 0. Therefore it suffices to show 
that 

rn m 
i=l i=l 

Hereto one denotes 0^ = X^'li which has the properties 

fc=i 

as follows from rtq). One then gets 



o<0i m) <i, £4 ro) = ™, as) 



Eh-*)=E r '-Ec ,f * = B 1 -c , H- E n (17) 

i=l z—1 fc=l fe=l fe=m+l 

Now using the ordering ( JToj ) of the r^, one gets the lower bound 

m m n / n \ 

5>< - Sl )> £(i - 4 m V- - E 4 ro) r m = m - E 4 m) r ™ = °. ( 18 ) 

i=l fe=l fc=m+l \ fc=l / 

where the last step follows because of Eq. (|l6|). Therefore Eq. (plf ) has now been proven. Inserting this in Eq. jl^ ) 
one finally has 

W > (19) 

This derivation concludes the proof of Thomson's formulation of the second law for this case: from a system in the 
equilibrium state work cannot be extracted in a cyclic process. The inequality sign says that work can nevertheless 
be done on the system, as is physically obvious. 

At zero temperature the equilibrium state is the ground state. The inequality W > is then obvious without any 
derivation, and confirms that no work can be extracted from the ground state. 
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III. MICROCANONICAL INITIAL DISTRIBUTION 



The above analysis assumes that the initial state of the system is the Gibbs distribution. In the present section we 
shortly consider some other distributions of statistical physics. First let us notice that we only used two properties 
of the Gibbs distribution: commutation with the initial Hamiltonian and the opposite ordering of the corresponding 



eigenvalues, as given by Eqs. (|9, 10). Thus, the no work-extraction principle: W > 0, is valid for all initial distributions 
which satisfy these properties. As a particular case, we mention the generalized microcanonical distribution or 9- 
distribution Q 

r k = — 9(m - k), 1 < k < n, 1 < m < n, (20) 
m 

where Q(k) is step function: 9(x > 0) = 1, 9(x < 0) = 0. Thus all energy levels below a fixed level h m are equally 
populated, while the energy levels larger than h m are not populated. The monotonicity properties (^|, [h]) are obviously 
satisfied, so that W > is also valid for the present case. 

For the strictly micro-canonical ensemble one considers an energy shell {£ — d<?,£), which is a group of energy 
levels such that the difference d£ between the maximal and the minimal energy level of the shell is smaller than a 
characteristic uncertainty of energy JlJ]. Let the total number of levels within the shell be f2. The states within the 
shell d£ are equally probable, 

r k = i (21) 

for any level hk belonging to the shell, while for other levels one has — 0. It is seen that the arrangement of r^'s is 
non-monotonous as soon as the shell is located above the vacuum, i.e. if the minimal energy of the shell is higher than 
the vacuum energy. For this case a straightforward application of the above theorem is impossible. Let us see where 
precisely our proof fails. The simplest situation of this kind is a shell which consists of one single non- vacuum energy 
level. For simplicity suppose that we are trying to check Eq. ( |l5| ) with a distribution r% — 0, r2 — r%.... = r n > (this 
is a shell with n — 1 levels, which just starts one level above the vacuum). As expected, Eq. (|l5| ) can be violated for 
m = 1 

n 

ri- Sl = -J2<t>k } rk<0. (22) 

k=2 

More generally, a negative contribution to the work arises from systems that, due to the cyclic process, end up in 
energy levels below the shell. As a result, the theorem cannot hold in full generality, and may not apply, e.g., to small 
microcanonical systems. 

The above arguments are simple enough to convince us that a proper formulation of the second law for the micro- 
canonical ensemble should be connected with certain limitations. As already discussed, one way is to require that the 
shell is so wide that it includes the vacuum state, and then one has the ^-distribution or generalized microcanonical 
distribution; the validity of W > was shown above. Another way is to consider only those unitary operations 
which do not bring system to energies less than the lower shell-limit £ — d£ ; under this condition the theorem again 
applies, since the dangerous terms (those with energies below the shell) have vanishing matrix element v m k- For large 
systems it is well known that almost all states are very near the maximal limit £ of the shell. Let us suppose that 
for a macroscopic system with N degrees of freedom, £ is proportional to N. The shell thickness d£ has to be much 
smaller than the typical uncertainty y/N of the energy. Let us choose d£ ~ N a with a < |. Now given the fact that 
almost all systems of the ensemble have energy very close to the upper bound £, extraction of energy less than N a 
is ruled out for almost all members of the ensemble. On physical grounds one then expects that extraction of more 
energy is also very unlikely. This means that the central inequality W > holds for all practical purposes in the 
micro-canonical ensemble. 



IV. CLASSICAL PROOF OF THOMSON'S FORMULATION. 



The above quantum result remains valid if the spectrum becomes dense, so the classical case is included into the 
consideration. Nevertheless, for the interested reader we will briefly outline the proof of the second law, when starting 
immediately from the classical formalism. Here the state of the system is described through the probability density 
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V(p, g, t) as a function of time t, the canonical momentum p, and the canonical coordinate q (in fact p and q can be 
arbitrary dimensional vectors; since the generalization to this case is obvious, it will not be discussed by us separately). 
The initial Hamiltonian and the time-dependent Hamiltonian are still denoted by Ho and TC(t), respectively. The 
evolution of V is described by the Liouville equation: 

d{P{t)=L{t)V{t), V{t) = Te$l dsC{s) V{<S), (23) 



r e /» ds£(s) EHVfd Sl fd S2 ... /' ds k C( Sk )...C( 



_ dH(p,q;t) d &H(p,q;t) d , x , , , , , ... , 

£{t) = — — — — , Te-ia y '=l+y^j dsi / ds 2 ... / ds k L(s k )...C{si), (24) 



where £ is the Liouville operator and T is the chronological symbol as defined above. For a cyclic variation the work 
in the classical situation reads: 



W = J dpdqH O (p,q)[V(p,q,t)-r(p,q,0)]. (25) 
Eq. ( p3| ) can also be written in the integral form: 

V(p,q,t) = [dp'dq'V(p',q',0)lC(p,q;t\p',q';0), K(p, q; t\p\ q'- 0) = tX ds C[s) S(p - p') S(q - q'). (26) 
Now the work done by the external source reads: 

W= f dpdqdp'dq'H (p,q)V(p',q',0)IC(p,q;t\p',q';0)~ f dpdq H (p, q) P(p, q, 0). (27) 



The analogy with Eqs. (^, is by now fully obvious. In particular, the role of the discrete indexes i and k in those 
equations is now played by the continuous double-indices (p, q) and (p', q'); the role of v ik is played by JC(p, q; t\p', q'; 0). 
Due to its definition (p6|), K.(p, q; t\p', q'; 0) does have the standard properties of the conditional probability distribution, 
and one additional property which makes it a double-stochastic (continuous) matrix: 

fC{p,q;t\p',q';0)>0, J dpdq fC{p, q; t\p', q'; 0) = J dp' dq' fC{p, <?; t\p', q'; 0) = 1. (28) 

The only non-trivial property is the last one, but it quite clearly follows from (^4], |2^) upon noting that £ is a 
differential operator and integrals similar to J dp' dq' C(si)...C(s k )S(p — p') 5{q — q') are equal to zero. 

Once the property of the double-stochasticity and the essential similarity between (|^, |^) and (27) is established, it 
is a matter of repetition to derive the proof of 

W > (29) 

in the classical case. The reader should notice that by saying this we ignore all convergence problems which can arise 
due to the continuous character of the considered classical situation. The most reasonable way to overcome such 
problems is to introduce an additional regularization. However, we will leave the situation as it is, since the readers 
who are sensitive to this kind of problems are just invited to get the classical situation as the limiting case of the 
above quantum proof (after all, the quantum formulation is the physical way to regularize the classical problem). 



V. DISCUSSION 



In classical physics there are many equivalent formulations of the second law. Examples are: non-decrease of 
entropy of a closed system, heat goes from high temperature to low temperature, the Clausius inequality dQ < TdS , 
non-negativity of the rate of entropy production and non-negativity of the rate of energy dispersion. A more folkore 
minded example is the absence of steady currents. In recent studies of quantum systems, several of these formulations 
have been questioned |^-^,^,^|. The fundamental question is then whether there is still a unique formulation that is 
satisfied in all cases. The aim of this paper is to demonstrate that there indeed exists such a formulation, and it is 
related with work, which, fortunately, is more accessible than heat and certainly more accessible than entropy, for 
which there are many definitions H. 
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In the present section we will discuss the above theorem and its relations with the standard understanding of the 
second law, as well as we outline some relatively straightforward applications of the above theorem. 

It is clear that the theorem forbids even one single work extraction cycle. This is to be put in contrast with the 
following known version of the second law: No perpetuum mobile of the second kind (i.e., a device which makes as 
many work-extracting cycles as one pleases) exists. It is a particular case of our theorem, but we now point out that 
it is in fact much more weaker, as its validity depends only on the existence of a ground state. This ground state 
does not have even have to be unique, as required for the validity of the third law [jD. Indeed, starting from any state 
and making sufficiently many work-extracting cycles with a finite extracted work per cycle, one will decrease the final 
average energy of the system below its ground state energy, which is impossible. So already a clear-cut formulation 
of the statement allows us to unmask the above no perpetuum mobile statement as a basically trivial consequence of 
quantum mechanics, rather than a deep theorem on (quantum) statistical physics. Our new theorem (even one cycle 
is forbidden) heals the problem, by forbidding 'perpetuum mobile' with any finite number of cycles, at least as long 
as one starts in equilibrium. 

When proving the above theorem we did not use any special property of the initial Gibbs distribution, except for 
its commutation with the initial Hamiltonian and the opposite ordering of the corresponding eigenvalues (see Eqs. (||) 
and (flo|)). If the initial distribution is Gibbsian but the temperature is negative, this ordering property is lost, and 
the theorem does not hold. This explains the role played by negative temperature for lasers and masers |15|, where 
positive work extraction is the main state of affairs. However, for other initial distributions that do satisfy these 
properties, the derivation applies as well. The most interesting case is the generalized micro-canonical ensemble or 
0-ensemble, where all states below a given energy are equally probable. In typical situations a vast majority of the 
states have energy very close to the maximal energy, implying that, at least in statics, this generalized ensemble is 
equivalent to the micro-canonical ensemble itself. The same property puts forward that also our theorem applies for 
all practical purposes to the micro-canonical ensemble. 

Yet another line of generalization arises when one is noting that the features of the Hamiltonian TIq under time- 
inversion were irrelevant for the proof. Thus, the studied system may well contain an external magnetic field. In such a 
situation the system can contain persistent currents in the equilibrium state. Examples are Landau diamagnetism [jp, 
vortices in conventional superconductors |[(|, that may last days, boundary currents in the quantum Hall effect and 
persistent currents in mesoscopic rings. These effects are pretty counterintuitive from the classical thcrmodynamical 
viewpoint, and at the first glance may even appear as a violation of the second law. In particular, one of the widespread 
folkore-minded formulations of the second law refers to the impossibility of ongoing motion in the equilibrium state, 
and the persistent currents give an example of such a motion. Nevertheless, it does not imply any contradiction with 
the second law in the Thomson formulation, and also shows that for the present case the time-inversion invariance 
does not have any direct connection with the second law. Notice in this context that the second law in the equilibrium 
Thomson formulation was proven under the condition of time- inversion- invariance (see |l8|Jl9| ] and refs. therein). Since 
the invariance property is rather strong, the authors of these works got somewhat more detailed results than just the 
non-negativity of the work W > 0. Whether these results are valid for the considered more general case is still an 
open problem. 

We like to stress that our theorem also applies when the total closed system consists of a subsystem and a heat 
bath, that interact with each other. In that situation the typical case is that a work cycle is made by manipulating a 
parameter of the subsytem. This is the situation considered in Ref. |^|, and it could be checked that, when starting 
from equilibrium, the total work for making a cyclic change is always positive. 

Let us notice that, after one cycle has been made, the system can locally return to equilibrium. Then surplus of 
energy runs away (dissipates) in the bath. When this process has settled, additional cycles cost additional work. 
Employing a standard argument, we can now show that non-cyclic changes, that are made in such a manner that 
afterwards one waits long enough to erase memory effects, also disperse energy. Indeed, by closing the cycle, there 
should always be dispersion, and this is only possible in all cases if each part disperses energy. 

Finally, we would like to analyze two widespread opinions about the second law. In their book pj Landau and 
Lifshitz state that the second law is incompatible with the microscopically reversible quantum dynamics, and that 
the second law can somehow be connected with the quantum measurement process, which in view of these authors 
is an inherently irreversible process imposed on the reversible quantum formalism. As we see above, no quantum 
measurement process is directly involved into the derivation of the second law, and the standard quantum-mechanical 
formalism is completely enough. Moreover, the dynamics of the system is unitary, i.e. it is invertible as precisely as 
one wishes, so that no arrow of time is involved in the presented derivation of the second law. 

Within another school of thinking, Zurek and his coworker |l7| | claim that the second law does arise as a consequence 
of the interaction between a quantum system and its thermal environment (environment-induced superselection rules). 
This is again not supported by the above proof, since it does not suppose the existence of a thermal environment, 
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although such a case is not excluded, provided that the system and its environment are considered within one closed 
system. Of course, this remark does not mean that the second law has nothing to do with thermal environments in 
general. They are just not necessary for the rigorous statement of the Thomson formulation applied to the equilibrium 
state of a closed system. 

In conclusion, we have analyzed a mathematical theorem which serves as a basis for the derivation of the second 
law in the Thomson formulation. Once this clear-cut derivation is given, it is a matter of a simple logic to rule out 
some pertinent pre-supposes on the second law. In particular, we analyzed the "no perpetuum mobile" principle, 
which within the quantum theory was seen to be almost a trivial statement akin (and even weaker) to the third law. 
It is hoped that the present paper will put into the proper perspective the research devoted to the microscopical 
foundations and limitations of the second law |^|-^) , since it is absolutely necessary to have a rigorous formulation of 
this law within the quantum statistical thermodynamics before consideration of its limits and its generalizations. From 
our viewpoint, neither the folklore-minded statements typically encountered in textbooks, nor rigorous derivations 
within the axiomatic (formal) thermodynamics fully meet this goal. 
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